Preprocessing Opportunities in Optimal Numerical Range Partitioning
نویسندگان
چکیده
We show that only the segment borders have to be taken into account as cut point candidates in searching for the optimal multisplit of a numerical value rangewith respect to convex attribute evaluation functions. Segment borders can be found efficiently in a linear-time preprocessing step. For strictly convex evaluation functions inspecting all segment borders is also necessary. With Training Set Error, which is not strictly convex, the data can be preprocessed into an even smaller number of cut point candidates, called alternations, when striving for optimal partition. Examining all alternations also seems necessary. We test empirically the reduction of the number of cut point candidates that can be obtained for Training Set Error on real-world data. The experiment shows that in some domains significant reduction in the number of cut point candidates can be obtained.
منابع مشابه
Solving Set Partitioning Problems with Constraint Programming
This paper investigates the potential of constraint programming for solving set partitioning problems occurring in crew scheduling, where constraint programming is restricted to not employ external solvers, as for instance integer linear programming solvers. We evaluate preprocessing steps known from the OR literature on moderately sized set partitioning problems. Further, we propose a new prep...
متن کاملRefinement Tree Based Partitioning for Adaptive Grids
We present a new partitioning algorithm for grids obtained by adaptive reenement. The method uses the adaptive reenement tree to obtain information unavailable to other partitioning methods which use only the nal grid and/or some geometric data. The algorithm requires (typically) O(log(N)) operations after an O(N) preprocessing step. The method is guaranteed to produce perfectly balanced connec...
متن کاملA Preprocessing Technique to Investigate the Stability of Multi-Objective Heuristic Ensemble Classifiers
Background and Objectives: According to the random nature of heuristic algorithms, stability analysis of heuristic ensemble classifiers has particular importance. Methods: The novelty of this paper is using a statistical method consists of Plackett-Burman design, and Taguchi for the first time to specify not only important parameters, but also optimal levels for them. Minitab and Design Expert ...
متن کاملOn Optimal End-to-End QoS Budget Partitioning in Network Dimensioning∗
We investigate the problem of optimal end-to-end QoS budget partitioning to quantify the advantage for network dimensioning of having a non-uniform allocation of end-to-end QoS requirement over the links in a path. We extend a previous revenue maximization model to M/G/1 queuing models and propose a fast partitioning heuristic based on the M/M/1 case. We then show on small networks with M/G/1 q...
متن کاملPartitioning strategies for the Block Cimmino algorithm
In the context of the block Cimmino algorithm, we study preprocessing strategies to obtain block partitionings that can be applied to general linear systems of equations Ax = b. We study strategies that transform the matrix AA into a matrix with a block tridiagonal structure. This provides a partitioning of the linear system for row projection methods because block Cimmino is essentially equiva...
متن کامل